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ABSTRACT 

In  this  paper,  we  report  on  the  design  and  implementation  of  a 
system  to  silleviate  the  problem  of  rule  evaluation  for  the  linguist 
in  the  area  of  phonology.  The  system  permits  the  user  to  define, 
on-line,  sets  of  rules  statable  within  the  framework  presented  in 
The  Sound  Patterns  of  English  (N.  Chomsky  and  M.  Halle,  1968),  to 
define  phonemes  as  bundles  of  specified  distinctive  features,  to 
define  data  as  strings  of  phonemes  with  associated  grammatical 
structure,  to  test  the  effect  of  applying  rules  to  the  data,  and 
to  store  both  the  definitions  and  results.  The  system  was  written 
in  BBN  LISP  (Bobrow  et  al,  1967)  on  the  Scientific  Data  System 
940  computer.  The  rule  application  facility  described  in  detail 
later  was  implemented  by  translating  the  linguistic  rules  to  rules 
in  FLIP  (Teitelman,  1967),  a  format  directed  list  processor  em¬ 
bedded  in  LISP.  This  made  the  system  construction  easy  while 
providing  very  sophisticated  capabilities  for  the  linguist.  The 
system  is  designed  to  be  used  on-line  in  Interactive  fashion, 
with  control  returned  to  the  user  after  each  command  is  executed. 
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INTRODUCTION 

Linguistics  has  as  one  of  its  major  goals  the  development  of  a 
theory  of  language  which  is  powerful  enough  to  accurately  and 
precisely  characterize  the  linguistic  facts  of  a  natural  language. 
Currently,  the  most  highly  developed  and  potentially  most  adequate 
theory  of  language  is  that  introduced  by  Chomsky  (1957)  and  in¬ 
volves  the  concept  of  a  transformational  grammar.  Although  this 
theory  has  been  designed  and  modified  to  enable  the  linguist  to 
state  generalization  about  a  language  in  a  simple  and  revealing 
way,  an  account  of  some  significant  portion  of  a  language  often 
results  in  a  complex  and  interdependent  set  of  rules.  Conse¬ 
quently,  it  becomes  more  difficult  for  the  linguist  to  evaluate 
the  work  he  has  done.  In  fact,  linguists  have  reached  the  point 
today  where  the  detail  of  analysis  makes  it  impracticable  to 
evaluate  by  hand  even  a  small  set  of  rules. 

In  this  paper,  we  report  on  the  design  and  implementation  of  a 
system  to  alleviate  the  problem  of  rule  evaluation  for  r,he  linguist 
in  the  area  of  Dhonology.  The  system  permits  the  user  to  define, 
on-line,  sets  of  rules  statable  within  the  framework  presented  in 
The  Sound  Patterns  of  English  (N.  Chomsky  and  M.  Halle,  1968), 
define  phonemes  as  bundles  of  specified  distinctive  features,  to 
define  data  as  strings  of  phonemes  with  associated  grammatical 
structure,  to  test  the  effect  of  applying  rules  to  the  data,  and 
to  store  both  the  definitions  and  results. 

The  system  is  written  in  BBN  LISP  (Bobrow  et  al.,  1967)  on  the 
Scientific  Data  System  9^0  computer.  The  rule  application 
facility,  described  in  detail  later,  is  implemented  by  translating 
the  linguistic  rules  into  rules  in  FLIP  (Teitelman,  1967),  a  format 
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directed  list  processor  embedded  in  LISP.  This  makes  the  system 
construction  easy  while  providing  very  sophisticated  capabilities 
for  the  linguist.  The  system  is  designed  to  be  used  on-line  in  an 
interactive  fashion,  with  control  returned  to  the  user  after  each 
command  is  executed. 

We  point  out  that  this  system  has  been  designed  to  provide  a 
phonological  rule  testing  capability  as  opposed  to  a  syntactic 
rule  testing  system,  several  of  which  have  developed  elsewhere. 
(See  Blair,  1966;  Friedman,  1967;  Gross,  1967;  and  Londe  and 
Schoene,  1967)  However,  because  of  the  modular  way  in  which  our 
system  has  been  designed,  it  can  be  made  applicable  to  syntactic 
rules  by  extension  rather  than  redesign. 


-2- 
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Definitional  Capabilities 


Within  the  framework  of  generative  grammar,  the  role  of  the 
phonological  component  is  to  interpret  the  output  of  the  syntactic 
component  and  convert  this  into  an  appropriate  phonetic  represen¬ 
tation.  Thus,  the  linguist  in  formulating  a  phonologic?  .  rule  is 
concerned  first  with  identifying  a  relevant  phrase  marker — a  tree 
structure  having  phonemes  and  grammatical  markers  as  the  symbols 
of  its  terminal  string — and  then  in  converting  this  string  Into 
a. phonetic  representation.  For  example,  a  linguist  writing  a  set 
of  rules  to  account  for  the  assignment  of  the  correct  phonetic 
form  for  English  noun  plurals  is  concerned  with  relating  the 
following  types  of  strings 


bite" 

[bayt ]+PL 

[bayts] 

tiff" 

[tif]  -JPL 

[tifs] 

lid" 

[lid]+PL 

[lidz] 

love" 

[lov]+PL 

[lavz] 

fish" 

[fis]+PL 

[fisiz] 

buzz" 

[baz]+PL 

[b  z4z] 

As  the  examples  show,  there  are  three  plural  endings,  [sj,  [z], 
and  [4z].  Examination  of  English  noun  plurals  quickly  reveals 
that  plural  assignment  is  not  at  all  random  but  depends  solely  on 
the  phonetic  form  of  the  last  phone  in  the  noun  singular  form. 

The  following  three  phonological  rules  generate  the  correct  plural 


endings . 

P-  vocalic  [ 

R1 

PL-^[4z] 

/ 

1  +strident|  . 

L-grave  J 

R2 

PL -Ms] 

/ 

t-volce]  _ 

R3 

PL— Mz] 

-3- 
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These  three  rules  are  ordered  such  that  the  rule  R1  is  tried  first 

to  any  noun-fPL  sequence.  R1  states  that  the  plural  morpheme,  PL, 

is  converted  into  the  phonetic  form  [4z]  if  it  (the  PL)  follows  a 

phone  which  is  non-vocalic  (all  consonants  except  "1"  and  "r"),  is 

strident  (phones  with  a  hissing  or  hushing  sound)  and  is  non-grave 

(phones  which  do  not  have  a  place  of  articulation  at  the  front 

(e.g.,  "f",  "v",  "p")  or  the  back  ("k",  "g")  of  the  mouth.  R2 

states  that  PL  is  converted  into  the  phone  [s]  after  any  non- 

voiced  phone.  Since  R1  has  already  been  tried,  it  may  be  the  case 

that  R2  cannot  apply  because  the  plural  marker  has  already  been 

converted.  For  example,  R1  when  tried  to  the  form  [fis]+PL  is 

found  to  be  applicable  and  the  PL  is  converted  into  [4z].  If  this 

conversion  had  not  occurred,  R2,  when  tried,  would  have  been  found 

applicable  and  have  converted  PL  into  [s],  thus  causing  the 

/ 

unacceptable  plural  noun  [fus].  R3  is  tried  after  both  R1  and  R2 
and  will  be  applicable  in  case  neither  of  the  first  two  rules  has 
been  applied,  since  R3  simply  converts  PL  into  the  phone  [z],  A 
discussion  of  this  set  of  rules  as  well  as  a  very  insightful 
alternative  can  be  found  in  Keyser  and  Hal.le,  1968.  For  a  very 
thorough  and  detailed  analysis  of  the  phonological  component  of  a 
generative  grammar  and  English  pnonology,  the  reader  is  referred 
to  Chomsky  and  Halle,  1968. 

For  a  phonological  rule  system  to  satisfactorily  simulate  the 
effect  of  a  set  of  phonological  rules  we  need  at  least  three 
definitional  capabilities:  for  phonemes;  for  P-markers  (or  trees, 
as  we  shall  henceforth  refer  to  them);  and  for  rules.  We  now 
examine  these  definitions  in  that  order. 


Report  No.  1589 


Bolt  Beranek  and  Newman  Inc 


In  keeping  with  the  framework  of  Chomsky  and  Halle,  1968,  a 
phoneme  is  defined  as  a  set  of  distinctive  features.  Each  dis¬ 
tinctive  feature  (e.g.,  vocalic,  consonantal,  strident,  fricative) 
has  an  associated  specification  which  is  normally  marked  either 
Plus  (+)  or  minus  (-),  but  which  in  the  case  of  the  prosodic  fea¬ 
tures  of  stress  and  intonation  may  take  a  numerical  value  such  as 

0 »  1>  2 ,  3,  •••, 

Within  our  phonetics  system,  a  phoneme  is  represented  by  a  list 
of  just  those  features  which  are  marked  plus  (or  have  other  non- 
negative  value).  Thus  a  phoneme  /A/  which  was  marked  positive 
for  the  features  BACK,  LOW,  and  VOC  (vocalic),  and  a  STRESS 
value  of  2  would  be  represented  as  the  list 

(VOC  BACK  LOW  STRESS  2) 

Numerical  values  of  features  immediately  follow  the  feature 
names;  all  features  in  the  system  not  included  in  the  list 
defining  the  phoneme  are  assumed  marked  minus,  including  STRESS. 


Phoneme  names  in  the  system  can  be  any  string  of  characters  not 
containing  parentheses,  brackets,  commas  or  spaces.  For  the 
most  part,  it  is  possible  to  use  the  orthography  normally  found 
in  the  linguistic  literature  with  certain  exceptions  due  to  the 
teletype  character  set.  For  example,  the  phoneme  ///  might  be 
rendered  as  SH  —  all  teletype  characters  are  printed  in  upper 
case  —  /ae'  might  be  rendered  as  AE,  etc. 


Finally,  we  remark  that  no  provision  has  been  made  within  the 
system  to  differentiate  between  phoneme  and  phone  specification. 
The  user  must  define  phones  using  the  same  instruction.  This 
results  in  a  list  of  phonemes  defined  to  include  both  those  sets 
of  specified  distinctive  features  which  are  interpreted  by  the 
linguist  as  phonemes,  and  those  interpreted  only  as  phones. 

There  appears  to  be  no  difficulty  in  combining  the  two  types  of 
entities  and,  as  we  will  see  below,  it  permits  a  much  more 
efficient  output  of  the  steps  of  a  derivation. 

*  In  all  examples  we  refer  to  segments  as  having  +,  -,  or 
numerical  specifications  although  within  the  system  a  segment 
contains  only  the  names  of  features  for  which  it  is  non-negatively 
specified. 


a* 
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Tree  Definition 


Phonological  rules  operate  on  trees  rather  than  phonemes  in 
isolation.  These  trees  are  represented  in  our  system  by  lists; 
as  an  example,  the  tree  [bayt]+PL  discussed  earlier  is  represented 
as 

((N)  3  AT  T  +  PL) 

To  define  a  tree  in  the  system,  the  user  types 

DTREE  <Name>  ^Tree  Definition> 

where  the  syntax  for  tree  can  be  represented  as 

<Tree>  =  ((<Syntactic  Marker>n)  <Segment>n  ) 

((<Syntactic  Marker>n)  <Tree>n) 

We  have  user  here  an  extended  form  of  the  standard  BNF. 

The  superscript  n  following  a  name  Indicates  that  one  or  more  of 
the  items  may  occur.  A  <Seginent>  is  either  a  phoneme,  specified 
by  its  phoneme  name  or  its  phoneme  definition,  or  a  non-phonetic 
atomic  symbol  (such  as  H  above)  which  is  used  as  a  marker.  The 
phoneme  definition,  (the  list  of  positively  specified  features) 
is  used  in  place  of  the  phoneme  name  in  the  internal  representation 
of  a  tree. 

A  tree  is  a  list  containing  first  a  syntactic  marker,  followed  by 
the  components  which  make  up  this  syntactic  entity.  Often  the 
syntactic  marker  is  composed  of  more  than  just  a  single  category 
such  as  N  (noun).  For  example,  if  an  English  noun  were  marked  for 
gender,  the  noun  "bite"  could  be  represented  as: 


( (N  NEUT)  B  AY  T) 
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Note  that  any  information  concerning  rule  exception  features 
would  be  part  of  the  syntactic  marker. 

A  second  type  of  tree  definition  is  used  to  create  a  set  of  data. 
Typing 

CTREE-  T0  (ALL  T1  T2  .  .  .  TN) 

defines  T0  as  the  set  of  trees  Tl,  T2,  ...  TN.  Any  rule  applied 
to  T0  is  applied  to  all  these  trees  in  sequence.  This  naming 
schema  is  very  useful  when  the  data  remain  constant  but  the  rule 
definitions  are  being  altered. 

Rule  Definition 

A  phonological  rule  identifies  a  small  substring  of  a  phonemic 
string;  If  applicable  to  a  given  tree,  the  rule  effects  some 
change  in  this  substring,  for  example,  deleting  part  of  it,  or 
adding  a  phoneme  to  It.  A  rule  is  defined  within  the  system  by 
typing 

DRULE  <Name>  <Rule  Definition> 

The  form  of  rule  definitions  in  our  system  closely  parallels 
that  found  in  current  linguistic  literature,  both  in  terminology 
and  notation.  Certain  differences  arise  because  of  the  limited 
character  set  of  the  teletype  and  because  of  certain  assumptions 
underlying  the  characterization  of  a  rule.  These  will  become 
clear  in  the  following  discussion. 


. . . — Tim . . . . . . . mm. 
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We  distinguish  three  types  of  phonological  rules  within  the 
system:  a  simple  rule,  an  insertion  rule  and  a  string  rule.  It 
is  convenient  to  think  of  each  rule  as  consisting  of  a  left  hand 
side  (LHS)  which  specifies  the  condition  on  the  substring  to  be 
altered,  a  rignt  handed  side  (RHS)  which  specifies  the  change  to 
made  and  a  context  which  specifies  the  environment  in  which 
the  substring  matched  by  the  LHS  must  be  located. 

Simple  Rule 


A  simple  rule  has  the  form  (<segmentxsegment><context>) 
he  LHS  of  a  simple  rule  specifies  a  single  segment  which  is 
identified  by  either  a  phoneme  name  (e.g..  A),  an  undefined 
symbol  name  (a  non-,  honetic  symbol,  e.g.,  #)  a  bundle  of 
specified  features  (e.g.,  (+  voc  .  CNS)),.  The  fiHS  of  ,  , 

ru  e  also  specifies  a  single  segment,  identified  by  one  of  the 
three  forms  of  the  LHS  or  by  the  symbol  0  which  indicates  deletion 
of  the  LHS  element.  A  segment  of  a  tree  is  matched  by  the  LHS  of 
a  simple  rule  under  any  of  the  following  conditions: 

1)  if  the  LHS  is  a  phoneme  name  and  the  segment  has  the 
same  name ; 

2)  if  t.ie  LHS  is  a  non-phonetic  symbol  and  the  segment  is 
this  same  symbol; 

3)  or  if  the  LHS  is  a  bundle  of  specified  features  and  the 
segment  contains  corresponding  feature  specifications 
for  all  features  specified. 

All  feature  specifications  must  be  separated  from  the  feature 
name  by  a  space.  A  parenthesis  has  the  value  of  one  space. 


-9- 
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Every  segment  of  the  tree  matched  by  the  LKS  of  a  rule  Is  marked, 
not  Just  the  first  one  encountered.  Although  In  defiling  a  phoneme 
a  feature  specification  for  a  phoneme  may  have  only  a  +,  or 
numerical  value  in  a  rule,  we  also  permit  the  values  <Name>, 

(<Name>)  and  (-<Name>)  to  occur  in  a  phoneme  specification.  These 
named  values  function  as  the  a,  B,  y  .specification  in  the  literature; 
that  is,  as' variables  whose  values  are  equal  or  not  equal  to  other 
variables  having  the  same  name.  For  example,  a  segment  specifica¬ 
tion  (X  VOC),  matches  any  phoneme,  but  the  system  associates  with 
the  name,  X,  the  value  of  the  specifications  of  the  feature  VOC 
in  this  phoneme.  If  the  specification  (X)  is  encountered  later 
(to  the  right  in  the  string  pattern  used  in  the  match),  the  asso¬ 
ciated  value  of  X  is  used  in  matching  the  current  phoneme.  Only 
if  the  value  of  the  two  specifications  are  identical  does  the 
second  segment  match  the  string,  assuming  all  other  requirements 
are  met.  Thus,  the  two  segments 

(X  VOC  -  FRT)  (+  CNS  (X)  FRT) 


would  match  the  substring  consisting  of 
(+  VOC  -  FRT)  (+  CNS  +  FRT) 
but  not  the  substring 
(+  VOC  -  FRT)  (+  CNS  -  FRT) 

Note  that  the  value  of  x  was  picked  up  from  the  feature  VOC,  but 
used  to  match  with  the  feature  FRT  in  this  example. 


-10- 
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The  specification  (-<Name>)  is  interpreted  similarly,  but  indi¬ 
cates  that  the  value  of  the  second  specification  must  be  different 
from  that  of  the  first. 


A  marked  segment  in  a  tree  is  changed  in  the  following  way: 

1)  if  the  RHS  of  the  simple  rule  is  a  phoneme  name  or  non- 
phonetic  symbol  this  item  replaces  the  marked  segment 
in  the  P-marke1" 

2)  if  the  RHS  is  a  list  of  phoneme  names  and/or  non-phonetic 
symbols  (but  not  a  bundle  of  specified  features)  smarting 
with  a  colon,  e.g.,  (:  A  B)  all  these  items  are  inserted 

for  the  LHS 

3)  if  the  RHS  Is  a  bundle  of  specified  features,  the 
marked  segment  is  changed  to  reflect  that  set  of 
specified  features 

4)  if  the  RHS  is  0  the  marked  segment  is  deleted 

Marking  of  all  identified  segments  is  done  first  and  then  all  the 
changes  are  made. 


The  following  have  been  constructed  to  illustrate  these  cases. 

(in  the  format  to  be  typed  to  the  system) : 

fiu^e  Comment 

DRULE  R1  ((+  VOC  +  VOICE)  (-  VOICE))  Every  segment  marked 

(+  VOC  +  VOICE)  is  now 
marked  (-  VOICE) 

DRULE  R2  (  0  (-  VOC))  Every  occurrence  of 

phoneme  0  is  marked 
(-  VOC)) 


DRULE  R3  (A  E) 
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Each  occurrence  of  a 
phoneme  segment  A  is 
replaced  by  an  E) 
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DRULE  R4  (#  (+  SEG)) 


DRULE  R5  ((+  VOC)  0) 


The  nonphonetic 
symbol  #  Is  replaced 
by  a  phonetic  segmenc 
with  just-  feature  SEG 
marked  +) 

Every  segment  marked 
(+  VOC)  Is  deleted) 


DRULE  R6  (E  (:  A  R))  All  occurrences  of  the 

phoneme  E  are  replaced  by 
the  sequences  of  two 
phonemes  A  and  R 


The  simple  rules  shewn  above  operate  on  all  occurrences  of  an  Item 
In  a  phoneme  string  matching  the  LHS  of  the  rule.  However,  the  user 
can  restrict  the  domain  of  this  change  by  specifying  a  <context> 
for  the  LHS  for  which  the  rule  is  applicable.  The  <context>  is 
stated  In  the  rule  definition  by  following  the  LHS  and  RHS  by 

/<Left  Context>  —  <Rlght  Context> 

where  ”  —  ”  marks  the  position  of  the  LHS  in  this  context.  Either 
or  both  contexts  may  be  empty. 

The  LHS  is  Inserted  in  the  context  for  the  — .  This  sequence  of 
<Le ft  Context>  -LHS-<Right  Context>  can  be  considered  a  pattern 
which  will  match  a  substring  of  the  phoneme  string  if,  and  only 
If,  each  individual  elementary  pattern  matches  consecutively  a 
segment  of  the  phoneme  string.  We  discuss  these  elementary 
patterns  below.  The  implementation  of  the  matching  process 
utilizes  this  entire  string  pattern,  with  only  one  distinction 
made  for  the  LHS  pattern;  a  special  mark  is  inserted  before  LHS 
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pattern  to  save  its  matching  position  in  the  phoneme  string  if  a 
complete  match  is  successful.  Matching  substrings  are  always 
found  in  left  to  right  order.  This  is  important  to  remember  in 
utilizing  "named"  feature  specifications,  as  mentioned  earlier, 
and  in  similar  naming  conventions  discussed  later. 

The  rule  R7,  defined  by  typing: 

DRULE  R7  ((+  VOC)(+  STRESS)  /  (+  VOC  -  STRESS)(+  CNS)  —  (+  CNS)) 

causes  any  vowel  segment  to  become  stressed  which  immediately 
follows  an  unstressed  vowel  and  a  consonant  and  immediately  pre¬ 
cedes  another  consonant. 

Note  that  when  R7  is  applied  to  a  phoneme  string  of  alternating 
unstressed  vowels  and  consonants,  all  vowels  but  the  first  will 
be  made  (+  STRESS)  since  charges  are  made  only  after  all  sub¬ 
strings  matching  the  string  patterns  have  been  found.  However, 
by  replacing  — ,  the  LHS  position  mark,  by  ++,  one  can  specify 
that  the  change  is  to  be  made  in  the  phoneme  string  as  soon  as 
each  match  is  found.  In  this  case,  the  result  would  have  only 
the  second,  fourth,  ...,  vowel  becoming  stressed. 

Finally,  if  a  rule  has  been  defined  as  above,  contexts  may  over¬ 
lap;  that  is,  the  string  of  segments  in  the  tree  identified  as 
part  of  an  acceptable  right  context  for  an  occurrence  of  the 
LHS  of  the  rule  may  function  as  the  left  context  for  another 
occurrence  of  the  LHS.  To  avoid  having  overlapping  of  context 
the  user  can  use  //  instead  of  /  in  the  rule  definition.  In  the 
following  rule 

DRULE  R8  ((+  VOICE)  (+  HIGH)  //  (+  VOICE  )  — ) 
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only  alternating  elements  of  a  string  of  voiced  segments  will  be 
made  (+  HIGH)  since  the  LHS  is  prevented  from  acting  as  a  left 
context  by  the  //. 


In  addition  to  the  three  types  of  segments  which  can  make  up  the 
LHS  of  a  rule,  there  are  a  number  of  other  elementary  patterns 
which  can  be  used  in  the  specification  of  the  context  or  the  RHS 


1)  (@  <Synmk>1. . . <Synmk>n) 


This  elementary  pattern  may  only 
be  the  first  element  in  the 
context  and  specifies  that  this 

rule  is  applicable  only  to  a  tree 
(phoneme  string)  marked  with  all 
the  syntactic  markers 

<Synmk>1. . . <Synmk>n 
(but  perhaps  others  too). 


2)  $ 


3)  (!  <Name><Specfeat>n) 

(=  <Name><Specfeat>n) 


Equivalent  to  the  variable  X  in 
linguistic  notation.  Will  match 
any  number  of  elements  of  the 
substring  (including  zero,  which 
is  tried  first). 

Used  in  conjunction  with  each 
other.  ( !  <Name><Specfeat>n) 
matches  any  segment  having  the  n 
specific  features  listed  and 
associates  <Name>  with  this  seg¬ 
ment.  (f  <Name>)  associates 
<Name>  with  any  segment. 

(■  <Name><Specfeat>n)  matches  any 
segment  having  exactly  the  same 
feature  specification  as  the  seg- 
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ment  already  associated  with 
<Name>,  except  for  the  specified 
features  listed.  (=<Name>) 
matches  a  segment  identical  to  the 
one  already  associated  with 
<Name>.  The  pattern 
(t  X  A  STRESS)  (■  X  (-A)  STRESS), 
for  example,  matches  any  two 
phonemes  which  are  identical  ex¬ 
cept  for  the  value  of  the  feature 
STRESS . 

It  is  important  to  recognize  that 
because  a  string  pattern  is 
examined  by  the  system  in  a  left 
to  right  order,  the  user,  in 
formulating  a  rule  utilizing  this 
naming  convention,  must  ensure 
that  the  associating  of  the  name 
X,  (!  X...),  occurs  in  the  pattern 
to  the  left  of  a  pattern  which 
requires  the  associated  value  of 
X,  i.e.,  (=  X...).  The  first 
elementary  pattern  (!  X...) 
calls  attention  to  a  segment  and 
associates  the  name  X  with  it; 
the  second  only  compares  the 
composition  of  the  segment  to  the 
one  already  associated  with  the 
name  X,  taking  into  account  the 
differences  indicated.  If  the 
order  of  the  patterns  is  reversed, 
the  rule  will  never  apply  to  any 
string. 
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The  use  of  (a  <Name><Specfeat>n) 
in  the  RHS  should  be  obvious  from 
the  preceding  discussion.  A 
segment  (=  X  -  BACK)  in  the  RHS 
causes  a  copy  of  the  segment 
associated  with  X  to  be  used  on 
the  RHS,  with  the  feature  BACK 
specified  negative.  The  rule  R9 

((+  VOC)(»  X  -  STRESS)  /  (!  X  +  VOC  +  STRESS)  —  ) 

replaces  a  vowel  following  a 
stressed  vowel  bj  an  identical, 
unstressed  copy  of  that  vowel. 

i|)  (EITHER  <Segment>1  OR  <Segment>2  OR  ...) 

Used  to  indicate  that  the  segment 
matched  in  the  string  may  be 
either  that  specified  by 
<Segment>1  or  by  <Segment>2,  etc. 
One  of  the  options  must  be  matched. 
Note  that  <Segment>^  etc.  may 
contain  any  number  of  subpatterns 
e.g.,  (EITHER  (+  VOC)(-  VOC)  OR 
(#  (-  VOC))). 

This  disjunctive  specification 
may  be  used  as  well  to  designate 
possible  syntactic  markers  of  a 
tree.  The  specification 
(EITHER  (g  N  PL)  OR  (g  ADJ ) ) 
immediately  following  the  slash,  /, 

% 
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5)  (OPT  <Segment>) 


restricts  the  application  of  the 
rule  to  trees  having  either  the 
syntactic  markers  N  and  PL>  or  one 
having  the  marker  ADJ. 

Used  to  indicate  a  possible  but 
not  required  occurrence  of 
<Segment>.  The  <Segment>  may  be 
a  simple  segment  as  described  in 
the  discussions  of  the  LHS  of  a 
simple  rule  or  may  be  compound, 
built  up  out  of  the  basic  patterns 
we  are  presently  discussing.  The 
alterative  with  the  <Segment> 
present  is  tried  first.  The 
specification 

(+  VOC)( OPT (EITHER  (+  VOC)  OR 
(-  NAS) ) ) (+  CNS) 
matches  a  segment  marked  as 
(+  VOC)  followed  optionally  by  a 
segment  marked  either  as  (+  VOC) 
or  (-  NAS)  all  followed  by  a  seg¬ 
ment  marked  (+  CNS). 


6)  ( #<Num> <Num> <Segment > )  Used  to  specify  a  number  of 

successive  occurrences  of 
<Segment>.  The  first  <Num> 
indicates  the  lower  bound,  the 
second  the  upper  bound.  If  only 
one  <Num>  is  present,  it  is  inter¬ 
preted  as  the  lower  bound  with 
the  upper  one  indefinite.  If  no 
<Num>  precedes  <Segment>  the  case 
(#<Segment>) — the  <Segment>  need 
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not  occur,  one  segment  may  occur, 
two  <Segments>  may  occur  .... 

Note  that  the  pattern  (.OPT 
<Segment>)  is  equivalent  to 
(#07  <Segment>).  As  an  example, 
the  context  /(#  1  3  (  +  VOC))  — 
requires  that  the  LHS  occur  after 
at  least  one  but  no  more  than 
three  segments  marked  (  +  VOC). 

7)  (SIDE  <Pred>)  Used  to  place  conuitions  on  fea¬ 

tures  like  stress  which  may  have  a 
numerical  specification.  We  allow 

<Pred>  to  be  either  an  elementary 
predicate,  or  a  boolean  combination 
of  predicates  using  (NOT<Pred>), 
(AND<Pred>1  ...  <Pred>n)  and 
(OR<Pred>^  ...  <Pred>n).  The  two 
elei.^ntary  predicates  available 
are:  (N=  <VAL>1  <Val>2  interpreted 
as  "not  equal"  of  the  two  <Val>s 
(which  must  be  names  used  in 
specifications);  and 
(SLE<Val>1  <Val>2)  interpreted  as 
<Val>^  is  a  Stress  Less  than  or 
Equal  to  <Val>2»  As  an  example 
of  a  side  condition,  consider  a 
context 

/(A  STRESS)  (#  (+  CNS) )  (B  STRESS)  — (SIDE  (N=  A  B)) 

This  context  requires  that  the 
LHS  match  a  substring  only  if  it 
follows  two  vowels  of  unequal 
stress  which  are  separated  by 
any  number  of  consonants. 
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Insertion  Rule 


A  second  type  of  phonological  rule  is  the  insertion  rule,  which  is 
flagged  by  having  a  LHS  which  is  just  the  symbol  0  (zero).  This 
marker,  0,  matches  a  null  segment  every  place  in  the  string  which 
satisfies  the  context  (before  every  segment  if  no  context  is 
given).  This  null  segment  is  the  one  modified  by  the  RHS.  Thus, 
if  the  RHS  consists  of  a  single  phoneme,  the  effect  of  the  rule 
Is  to  insert  this  phoneme  in  the  place  specified  by  the  context. 

As  an  example: 

DRULE  RIO  (0  (:  +  A)  /  (g  N)  — #) 

will  cause  the  sequence  +  A  to  be  inserted  at  the  end  of  every 
tree  having  the  syntactic  marker  N. 

String  Rule 

In  order  to  allow  a  rule  to  change  more  than  one  segment  In  a 
string,  we  introduce  a  string  rule.  In  this  rule,  the  RHS  is  a 
list  starting  with  ''•"followed  by  any  number  of  RHS  formats. 

Each  succeeding  RHS  format  effects  a  change  on  a  successive 
element  of  the  phoneme  string  (with  two  exceptions  described 
below).  The  change  starts  at  the  position  matched  by  the  LHS 
element.  The  LHS  may  Itself  specify  more  than  one  element  as  a 
list  starting  with  however,  all  but  the  first  can  equally 
well  'e  put  in  the  right  context.  For  example: 

DRULE  Rll  ((*(+  VOC)  (+  VOICE))  (*  (+  STRESS)  0)) 

and 

DRULE  R12  ((+  VOC)  (*(+  STRE SS)0)  /  —  (+  VOICE)) 
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have  the  identical  effect  of  stressing  the  first  and  deleting 
the  second  of  any  sequence  of  segments  matching  (+  VOC)(+  VOICE). 

A  special  RHS  element  which  may  only  appear  in  a  string  rule  is 
(0  <Segment>)  which  inserts  the  segment  specified  by  <Segment> 
before  the  segment  in  the  phoneme  string  corresponding  to  this 
elementary  RHS  pattern. 

The  specification  *1  may  be  used  on  the  RHS  of  a  string  rule  to 
indicate  no  change  to  the  corresponding  segment  in  the  phoneme 
string.  For  example,  the  rule 

DRULE  R15  ( ( *A(+  CNS  -  “PUT  +  STRID)  B)(*(-  STRESS)  *1  0)) 

marks  the  A  as  (-  STRESS),  leaves  the  second  segment  as  is, 
and  deletes  the  third  segment,  B. 

Another  segment  specification,  *M,  can  be  used  on  a  RHS  to  affect 
more  than  one  item.  The  *M  stands  for  metathesis  and  has  the 
following  interpretation.  The  segment  of  the  P-marker  matched 
by  the  LHS  pattern  corresponding  to  the  *M  is  permuted  with  the 
following  segment.  In  order  to  alter  either  of  these  permuted 
elements,  the  form  (*M  <3egment>1  <Segment>2)  is  used,  where 
<Segment>1  and  <Segmer.t>2  correspond  to  the  segments  in  the 
permuted  order,  not  the  original  order  in  the  P-marker.  The  rule 

DRULE  R13  ((*  A  B)  *M) 

permutes  all  occurrences  of  A  and  B;  and  the  rule 
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DRULE  RllJ  (R  (*  #1  («M  (+  PRT) ) ) ) 


permutes  any  two  elements  following  an  R  and  marks  the  one  that 
becomes  adjacent  to  the  R  (+  FRT).  Nothing  is  done  to  the  segment 
which  originally  followed  the  R. 

Sequencing  Rules 

In  the  preceding,  we  have  discussed  the  types  of  phonological 
rules  and  their  specifications.  Now  we  introduce  a  notation  for 
sequencing  rules.  The  command 

DRULE  R0  (ALL  R1  R2  R3  ...  RN) 


has  the  effect  of  defining  the  rule  R0  as  the  list  of  rules 
(R1  R2  R3  ...  RN)  with  the  following  interpretation.  When  R0  is 
applied  to  some  tree  Tl,  first  the  rule  R1  is  applied  to  T1  and 
the  result,  T1R1,  is  either  a  new  tree  (in  case  R1  was  applicable) 
or  Tl  Itself.  Then  R2  is  applied  to  Tl^,  then  R3  to  the  result 
T1R1  R2  ,and  30  °n  until  RN  has  applied  to  T1R1  R2  _ 

Essentially,  this  (ALL  ...)  form  of  a  rule  definition  allows  the 
user  to  define  a  set  of  rules  and  cause  them  to  be  applied  in 
succession . 


The  instruction 

DRULE  R00  (ANY  R1  R2  ...  RN) 

has  a  slightly  different  interpretation.  As  before,  R1  will  apply 
first  to  Tl,  then  R2,  etc.  but  the  first  time  some  rule  is 
applicable  the  operations  are  carried  out  and  the  application  of 
R00  is  finished.  For  example,  if  R00  =  (ANY  R1  R2  R3)  and  if  R2 
were  applicable,  R3  would  never  be  applied  to  T1R1  R2,  Note  R1 
could  not  have  been  applicable  or  R2  would  not  have  been  tried). 
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Any  of  the  rule  names  within  an  (ALL  ...)  or  (ANY  ...)  form  can 
be  one  of  these  forms.  Thus  the  command 

DRULE  R000  (ALL  R1  (ANY  R2  R3)  Ri* ) 

is  a  well  formed  rule. 

Testing  Capabilities 

There  are  two  testing  modes  in  the  system:  TEST  and  WTEST. 
These  stand  for  test  and  watch  test,  respectively.  Suppose  we 
have 


Rl  =  (*  CNS  -  CONT  -  VOICE) 

<*  CONT  ♦  VOICE) 

/  (  +  VOC)  --  <+  VOC) 

T1  «  <<N)  #  P  A  P  A  #) 

where  Rl  is  a  rule  whi^h  makes  a  voiceless,  non-continuent 
consonant  voiced  and  continuent  in  intervocalic  position.  The 
instruction  TEST  Rl  T1  will  cause  the  system  to  respond  with 


TESTING 

Rl 

CCN)  4  P  A  P  A  4) 

(CM)  #  P  A  P  A  f) 
CCN)  #  P  A  B  A  #> 


rfhere  the  last  line  is  the  result. 

*  The  rules  and  data  in  this  section  are  adapted  from  Rogers,  1967. 
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If  the  result  of  a  TEST  command  contains  some  segment  composed 
of  a  bundle  of  specified  features  which  has  no  phoneme  name, 
this  bundle  of  specified  features  will  be  printed  instead  of  a 
single  equivalent  name  since  none  has  been  defined.  It  is  for 
this  reason  that  we  require  that  phonemes  and  phones  be  defined  in 
the  same  way  and  not  be  distinguished  formally  within  the  system. 

The  command  WTEST  provides  the  added  feature  that  the  result  of 
each  step  of  the  derivation  is  shown  to  the  user.  This  is  most 
useful  in  tracking  down  exactly  where  a  set  of  rules  is  producing 
unexpected  results.  For  example,  suppose  we  have  the  following 
rules : 


1. 


+CONT  ' 

-CONT 

-VOICE 


f+CONT  1 
L+VOICE J 


/  [+VOC]  —  [+VOCT) 


2 .  [+VOC] 


[+STRESS]  /  \ 


#  G-CONS]  —  J+UONSJ*  [+VOC] 
[+CONS](+VOC]  [+CONS]2  — 


3.  X 


-*>  [-VOICE]  /  J 


' 

+VOC 

f+CONS  1 

.-STRESS 

L- voice] 

r~  1 

r+voc  ] 

L+CONSJ 

L-VOICEJ 

r+voc  i 

l+VOICEJ 
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The  statement  of  these  rules  In  our  notation  is  the  following: 


R0  ■  CALL  R1  R2  R3  R4  R5> 

R1  s  (+  CNS  -  CONT  -  VOICE) 

(♦  CONT  ♦  VOICE) 

✓  C+  VOC)  —  <♦  VOC) 

R2  «  CANY  R2A  R2B) 

R2A  *  C+  VOC) 

C+  STRESS)  „ 

/  0  <*  CNS)  --  C #  I  2  C+  CNS))  C*  VOC)  0 

R2 6  ■  C+  VOC) 

C+  STRESS) 

/  0  i*  CNS)  C+  VOC)  COPT  C#  I  2  (♦  CNS))) 


3  |E  fg 
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R3  *  (ALL  R3A  R30  R3C> 


R3A  *  (♦  SEU) 

C-  VOICE) 

/  --  # 

R3d  »  '♦  VOC  -  STRESS) 

<-  VOICE) 

/  —  <#  2  2  <♦  CNS  -  VOICE))  <♦  VOC  ♦  VOICE) 

R3C  ■  <♦  CNS) 

<-  VOICE) 

/  —  <♦  VOC  -  VOICE) 

R4  ■  (ALL  R4A  R4tJ> 


R4A  ■  (♦  SEti) 

(-  VOC  -  VOICE) 

✓  (♦  VOC)  —  # 

R4ri  *  (♦  SE(i) 

0 

/  (♦  VOC  -  VOICE)  —  (♦  CNS  -  VOICE) 

R5  ■  (*  (I  X  A  STRESS)  (■  X  (-  A)  STRESS)) 
(*  0  (■  X  ♦  LONtj  ♦  STRESS)) 
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The  command 


WTEST  R0  T1 


will  cause  the  following: 


testing 

R0 

CCN>  #  P  A  P  A  #> 

<CN>  #  P  A  P  A  #> 

R0 

R1 

CCN>  #  P  A  B  A  #> 

R2 

R2A 

CCN)  0  P  A*  8  A  #> 

R3 

R3A 

<<N>  #  P  A*  8  A-  #> 

R38 

R3C 

<<N>  §  P  A*  8-  A-  #> 
R4 

R4A 

R48 

RS 

CCN>  #  P  A«  8-  A-  #> 


The  Initial  two  lines  contain  the  original  test  Items.  Each  rule 
name  Is  printed  out  before  It  Is  applied,  and  if  it  is  applicable 
the  changed  tree  is  printed.  In  this  case  one  can  easily  see  that 
Rl,  R2A,  R3A,  and  R3C  caused  all  the  changes.  The  final  result 
is  also  printed  at  the  bottom.  The  diacritic  "  '  "  indicates 
stress;  indicates  devoicing. 
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A  set  oi*  rules  can  be  applied  to  a  tree  containing  more  than  one 
syntactic  unit.  If  so,  all  substructures  are  transformed  first, 
and  the  results  minus  the  syntactic  categories  of  the  substructure 
concatenated  before  transformation  on  the  next  level. 


E 
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Editing  and  Output  Capabilities 

The  phonological  rule  testing  system  has  certain  editing 
capabilities  built  into  it  as  well  as  having  access  to  the 
BBN  LISP  Editor  (Bobrow,  et  al.  1967)  both  of  which  can  be  used 
to  modify  the  list  structures  representing  rules,  trees,  and 
phonemes.  Insertions,  deletions  and  replacements  and  other  more 
sophisticated  changes  are  made  easily  after  a  very  short  learning 
period.  To  delete  any  defined  item  the  user  needs  type  only 

DEL  <Name><Type> 

where  <Name>  is  the  name  of  a  rule,  phoneme,  or  tree  and  <Type> 
is  either  RULE,  PHONEME  or  TREE. 

Definitions  are  printed  out  by  using  one  of  two  print  commands. 

The  instruction 

P  R1 

will  cause  the  system  to  respond  with  (using  the  above  definition) 


R1  s  <+  CNS  -  CONT  -  VOICE) 

<+  CONT  ♦  VOICE) 

/  <♦  VOC)  —  C+  VOC) 

The  instruction 
PR  <Type> 

will  cause  the  entire  inventory  of  the  <Type>  specified  to  be 
printed.  Using  the  definition  above,  PR  RULES  would  result  in 
the  output  shown  on  page  22. 
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The  instruction  EDIT  calls  the  BBN  LISP  editor  without  leaving 
tho  phonological  rule  tester  system.  The  command 

EDIT  R1  RULE 

allows  one  to  edit  the  rule  Rl. 


The  use  of  this  editor  has  been  described  elsewhere  and  will  not 
be  presented  here.  (Cf.  Bobrow,  et.al.  1967) 

From  the  phonological  system,  a  user  can  write  a  file  containing 
the  data  he  has  generated.  By  typing 

SAVE  <NAME> 

a  file  of  that  <NAME>  will  be  created  and  saved.  It  later  can  be 
loaded  to  initialize  the  system. 
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Conclusion 


In  designing  this  phonological  rule  tester,  we  have  endeavored 
to  include  capabilities  which  facilitate  stating  all  the  phono¬ 
logical  rules  to  far  described  in  the  literature,  and  some  only 
suggested  in  private  discussions.  Accordingly,  many  rules  may 
be  stated  in  a  variety  of  ways  within  the  limitations  imposed  by 
the  system,  and  it  is  the  linguist  himself  who  is  forced  to  set 
whatever  restriction*  he  feels  necessary.  Finally,  we  remark 
that  we  have  defined,  with  difficulty,  all  of  the  phonological 
rules  found  in  Chapter  5»  Summary  of  Rules,  in  The  Sound  Patterns 
of  English  (Chomsky  and  Halle,  1968)  and  are  currently  in  the 
process  of  verifying  the  claims  made  in  the  text. 
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